Indicators Ratings Parallel =  8 ms
Indicators Ratings =  2 ms
CPU compute ratings: 1 iteration avg time: 664 ms
 
 ---- 
cuda_malloc<d_ratings> 2.29
cuda_malloc<d_companies_map> 0.46
cuda_malloc<d_indicatorsRatings> 3.68
kernel<computeRatingsKernel> 0.64
kernelSegment<sort> 1.78
toHostMemory<d_ratings> 0.37
toHostMemory<d_companies_map> 0.4
toHostMemory<d_indicatorsRatings> 7.18
ranking 0.57
TotalRankingTime 18.84

cuda_malloc<d_ratings> 0.27
cuda_malloc<d_companies_map> 0.09
cuda_malloc<d_indicatorsRatings> 0.1
kernel<computeRatingsKernel> 0.62
kernelSegment<sort> 0.52
toHostMemory<d_ratings> 0.4
toHostMemory<d_companies_map> 0.35
toHostMemory<d_indicatorsRatings> 8.05
ranking 0.62
TotalRankingTime 11.3

cuda_malloc<d_ratings> 0.29
cuda_malloc<d_companies_map> 0.09
cuda_malloc<d_indicatorsRatings> 0.08
kernel<computeRatingsKernel> 0.6
kernelSegment<sort> 0.6
toHostMemory<d_ratings> 0.8
toHostMemory<d_companies_map> 0.59
toHostMemory<d_indicatorsRatings> 8.51
ranking 1.17
TotalRankingTime 12.84

GRID K520 :  797.000 Mhz   (Ordinal 0)
8 SMs enabled. Compute Capability sm_30
FreeMem:   3741MB   TotalMem:   4096MB   64-bit pointers.
Mem Clock: 2500.000 Mhz x 256 bits   (160.0 GB/s)
ECC Disabled


